Studying the History of Ideas Using Topic Models
نویسندگان
چکیده
How can the development of ideas in a scientific field be studied over time? We apply unsupervised topic modeling to the ACL Anthology to analyze historical trends in the field of Computational Linguistics from 1978 to 2006. We induce topic clusters using Latent Dirichlet Allocation, and examine the strength of each topic over time. Our methods find trends in the field including the rise of probabilistic methods starting in 1988, a steady increase in applications, and a sharp decline of research in semantics and understanding between 1978 and 2001, possibly rising again after 2001. We also introduce a model of the diversity of ideas, topic entropy, using it to show that COLING is a more diverse conference than ACL, but that both conferences as well as EMNLP are becoming broader over time. Finally, we apply Jensen-Shannon divergence of topic distributions to show that all three conferences are converging in the topics they cover.
منابع مشابه
درآمدی بر تاریخ ذهنیت عامه در معماری ایران
The architectural works that remained from the long history of Iran are indeed treasures of Iranian architecture. However, these works are not perfect manifestations of the architecture which had been realized in Iran during centuries. Most of what we have inherited from this architecture are monuments. Such majestic works can hardly lead us to the major part of the architecture, which is popul...
متن کاملThe discourse theory of democracy and public sphere in Habermas's ideas
Research and scientific explanation about discourse democracy theory of Jurgen Habermas and studying and evaluating reflection and generalization of his philosophical and epistemological principles are objectives which the researcher follows in this research From this view, there is studied representation of concepts and categories such as cognitive interests, communication action, discoursing...
متن کاملScience Communication, A review on Importance, History, and Its Models
The article is an extended version of a talk on the issue given in Half-Day Seminar on the Complications of Scientific Publications held by Iranian Academy of Sciences, on July the ninth of 2019. Apparently, science communication, an old tradition and once very popular according to the literature reviewed, is not well advanced along with the magnificent scientific progress happened during the l...
متن کاملExplaining Pattern-Based Reading in TeachingArchitectural History and Evaluating its Effectivenesson Architecture Students’ Ideation and Insights
In the following article, the necessity of effectiveness and application of historicalknowledge in architectural design is discussed. In other words, how historical data canbe an approach to enhance students’ design insights. A review of the literature suggeststhat one of the challenges of teaching architecture is helping students in the process ofcreating new ideas. Accordingly, one of the pro...
متن کاملTopic Modeling and Classification of Cyberspace Papers Using Text Mining
The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008